Action Recognition in Multimedia Streams

نویسندگان

  • Rozenn Dahyot
  • François Pitié
  • Daire Lennon
  • Naomi Harte
  • Anil C. Kokaram
چکیده

It is well accepted that the rise in the proliferation of inexpensive digital media collection and manipulation devices has motivated the need to access this data by content rather than by keywords. The requirements of content based access are well understood by the digital media research community and there is no need to elaborate further here. Parsing multimedia streams by detection and classification of action implies modeling the dynamic nature of visual and audio features as they evolve in time. The Hidden Markov Model (HMM) has long been used to model dynamic behaviour in audio signals. Its power to capture complex behaviour in that domain has led to widespread use in visual content analysis because of the non-stationarity inherenet in those signals. However, subtleties in the application of HMMs are often unclear in the use of the framework in the visual processing community and the latter portion of this chapter sets out to expose some of these. Three applications are considered to motivate the discussions: actions in sports, observational psychology and illicit video content. Sports: Work in sports media analysis and understanding has been conducted for a decade now with clear motivation provided by the huge amount of sports media broadcasting on internet and digital television. An overview of content analysis for sports footage in general can be found in [22]. Action recognition here involves detection of certain plays and situations as dictated by the game domain e.g. pots, goals, wickets and aces. Illicit Content: The distribution of pornographic materials has also benefited from the digital revolution [6]. This kind of material is illegal in the workplace and is referred to as illicit content in this article. The issue of filtering this material has been of major concern since the introduction of the web in the early 1990’s. Pixalert’s ‘Auditor’ and ‘Monitor’, FutureSoft’s ‘DynaComm i:scan’ 2 and Hyperdyne Software’s ‘Snitch’ all provide image and

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Joint processing of audio and visual information for multimedia indexing and human-computer interaction

Information fusion in the context of combining multiple streams of data e.g., audio streams and video streams corresponding to the same perceptual process is considered in a somewhat generalized setting. Speci cally, we consider the problem of combining visual cues with audio signals for the purpose of improved automatic machine recognition of descriptors e.g., speech recognition/transcription,...

متن کامل

Activation Set: An abstraction for accessing periodic data streams

In distributed multimedia applications time-dependent data streams are conveyed and processed under real-time conditions. The timely accurate activation of the stream handlers processing the data units of streams requires deriving their scheduling times from the temporal properties of the data streams and the amount of data that is processed in each activation. In this paper, we propose the act...

متن کامل

Low-Cost Real-Time Gesture Recognition

A major impediment to developing real-time computer vision systems has been the computational power and level of skill required to process video streams in real-time. This has meant that many researchers have either analysed video streams off-line or used expensive dedicated hardware acceleration techniques. Recent software and hardware developments have greatly eased the development burden of ...

متن کامل

Serveron the Spring Real - Time System 1

An integrated platform which is capable of meeting requirements of both traditional real-time control processing and multimedia processing has enormous potential for accommodating various kinds of new applications. However, few, if any, research or commercial systems successfully provide architectural and OS mechanisms which can eeciently support both deterministic hard real-time computation an...

متن کامل

Multimedia Transaction Management

Current database management system techniques are insuucient to support the management of multimedia data owing to their time-sampled nature. The extension of database systems to support multimedia applications thus requires new mechanisms to ensure the synchronized presentation of multimedia data streams. In order to exibly and eeciently present multimedia data streams to users, media streams ...

متن کامل

Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion

Action recognition is an important yet challenging task in computer vision. In this paper, we propose a novel deepbased framework for action recognition, which improves the recognition accuracy by: 1) deriving more precise features for representing actions, and 2) reducing the asynchrony between different information streams. We first introduce a coarse-to-fine network which extracts shared dee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008